Long short-term memory について

Words near each other

・ Long Seridan Airport
・ Long Service and Good Conduct Medal
・ Long Service and Good Conduct Medal (New Zealand)
・ Long Service Awards (Denmark)
・ Long service leave
・ Long Service Medal
・ Long Service Medal (Military) (Singapore)
・ Long Service Medal, Bronze
・ Long Service Medal, Gold
・ Long Service Medal, Silver
・ Long Shadow (disambiguation)
・ Long Shadow (documentary)
・ Long Shadows (Warriors)
・ Long Shan Tang Temple
・ Long Shop Museum
・ Long short-term memory
・ Long shot
・ Long Shot (1939 film)
・ Long Shot (Kelly Clarkson song)
・ Long Shot (song)
・ Long Shot (TV series)
・ Long Shot Party
・ Long Shujin
・ Long Side
・ Long Siding, Minnesota
・ Long Since Forgotten
・ Long slow distance
・ Long snapper
・ Long Society Meetinghouse
・ Long solid mussel

Dictionary Lists

mini英和辞書

翻訳と辞書　辞書検索 [ 開発暫定版 ]

スポンサードリンク

Long short-term memory ：ウィキペディア英語版

Long short-term memory

Long short-term memory (LSTM) is a recurrent neural network (RNN) architecture (an artificial neural network) published in 1997 by Sepp Hochreiter and Jürgen Schmidhuber. Like most RNNs, an LSTM network is universal in the sense that given enough network units it can compute anything a conventional computer can compute, provided it has the proper weight matrix, which may be viewed as its program. Unlike traditional RNNs, an LSTM network is well-suited to learn from experience to classify, process and predict time series when there are very long time lags of unknown size between important events. This is one of the main reasons why LSTM outperforms alternative RNNs and Hidden Markov Models and other sequence learning methods in numerous applications. For example, LSTM achieved the best known results in unsegmented connected handwriting recognition,〔A. Graves, M. Liwicki, S. Fernandez, R. Bertolami, H. Bunke, J. Schmidhuber. A Novel Connectionist System for Improved Unconstrained Handwriting Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 5, 2009.〕 and in 2009 won the ICDAR handwriting competition. LSTM networks have also been used for automatic speech recognition, and were a major component of a network that in 2013 achieved a record 17.7% phoneme error rate on the classic TIMIT natural speech dataset.
== Architecture ==

An LSTM network is an artificial neural network that contains LSTM blocks instead of, or in addition to, regular network units. An LSTM block may be described as a "smart" network unit that can remember a value for an arbitrary length of time. An LSTM block contains gates that determine when the input is significant enough to remember, when it should continue to remember or forget the value, and when it should output the value.
A typical implementation of an LSTM block is shown to the right. The four units shown at the bottom of the figure are sigmoid units

y=s(\sum w_i x_i)

, where ''s'' is some squashing function, such as the logistic function). The left-most of these units computes a value which is conditionally fed as an input value to the block's memory. The other three units serve as gates to determine when values are allowed to flow into or out of the block's memory. The second unit from the left (on the bottom row) is the "input gate". When it outputs a value close to zero, it zeros out the value from the left-most unit, effectively blocking that value from entering into the next layer. The third unit from the left is the "forget gate". When it outputs a value close to zero, the block will effectively forget whatever value it was remembering. The right-most unit (on the bottom row) is the "output gate". It determines when the unit should output the value in its memory. The units containing the

\Pi

symbol compute the product of their inputs (

y=\Pi x_i

). These units have no weights. The unit with the

\Sigma

symbol computes a linear function of its inputs (

y=\sum w_i x_i

.) The output of this unit is not squashed so that it can remember the same value for many time-steps without the value decaying. This value is fed back in so that the block can "remember" it (as long as the forget gate allows). Typically, this value is also fed into the 3 gating units to help them make gating decisions.

抄文引用元・出典: フリー百科事典『ウィキペディア（Wikipedia）』
■ウィキペディアで「Long short-term memory」の詳細全文を読む

スポンサードリンク

翻訳と辞書 : 翻訳のためのインターネットリソース